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Abstract 

Recent results have shown that structured codes can be used to construct good channel codes, 
source codes and physical layer network codes for Gaussian channels. For Gaussian channels with 
secrecy constraints, however, efforts to date rely on random codes. In this work, we advocate that 
structured codes are useful for providing secrecy, and show how to compute the secrecy rate when 
structured codes are used. In particular, we solve the problem of bounding equivocation rates with one 
important class of structured codes, i.e., nested lattice codes. Having established this result, we next 
demonstrate the use of structured codes for secrecy in two-user Gaussian channels. In particular, with 
structured codes, we prove that a positive secure degree of freedom is achievable for a large class of 
fully connected Gaussian channels as long as the channel is not degraded. By way of this, for these 
channels, we establish that structured codes outperform Gaussian random codes at high SNR. This class 
of channels include the two-user multiple access wiretap channel, the two-user interference channel with 
confidential messages and the two-user interference wiretap channel. A notable consequence of this result 
is that, unlike the case with Gaussian random codes, using structured codes for both transmission and 
cooperative jamming, it is possible to achieve an arbitrary large secrecy rate given enough power. 
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I. Introduction 

The notion of information theoretic secrecy was first proposed by Shannon [1] whereby 
a message transmitted to a receiver is guaranteed to be kept secret from an eavesdropper, 
irrespective of the computational power it possesses. In particular, it was shown that it is possible 
that the eavesdropper gains no information regarding the secret message having intercepted the 
cryptogram, albeit at the expense of very long keys [1]. Wyner, in [2], established that, more often 
than not, the eavesdropper (Eve) has a noisy copy of the signal transmitted from the source, and 
exploiting the channels, building a useful secure communication system per Shannon's notion is 
possible [2]. Csiszar and Korner [3] extended Wyner's setting to the general discrete memoryless 
wiretap channel and established its secrecy capacity. 

Numerous channel models have since been studied using the information theoretic secrecy 
framework. The maximum reliable transmission rate with secrecy was identified for some basic 
models including the Gaussian wiretap channel [4], the MIMO wiretap channel [5], [6] and 
the MIMO Gaussian broadcast channel with confidential messages [7], [8]. Generally speaking, 
secrecy capacity regions for multi-transmitter models remain as open problems, though some 
specific cases are solved, e.g., sum secrecy capacity for a degraded Gaussian multiple access 
wiretap channel [9], [10]. For other channels, upper bounds, lower bounds and some asymptotic 
results on the secrecy capacity exist, see for example [1 1]— [16] . For the achievability part, 
Shannon's random coding argument proves to be effective in these works. 

On the other hand, it is known that the random coding argument may be insufficient to prove 
capacity theorems for certain channels [17], and that the use of structured codes, e.g., lattice 
codes, is called for. This body of work suggests that using structured codes has two benefits: 
First, it is relatively easy to analyze large networks with these codes. For example, in [18], [19], 
the lattice code allows the relaying scheme to be equivalent to a modulus sum operation, making 
it easy to trace the signal over a multi-hop relay network. Secondly, the structured nature of 
these codes makes it possible to align unwanted interference, for example, in the interference 
channel with more than two users [20]-[23], and the two-way relay channel [18], [19]. 

A natural question therefore is whether structured codes are useful for secret communication 
as well, that is whether imposing a structure on the codebook carries any inherent advantage 
from the secrecy perspective. In this work, we shall answer this question positively and present 
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a structured coding scheme, for which we will be able to compute the secrecy rate. Furthermore, 
we will demonstrate that there are channel models where the secrecy rates offered by structured 
codes outperform that of simple random codes, in particular, for channel models with additive 
Gaussian noise. 

A large body of recent work point to the usefulness of lattice codes for Gaussian channels 
without secrecy [17]— [19], [21], [22]. However, computing the secrecy rate for lattice codes is 
especially challenging for Gaussian channels. The challenge arises from the manner in which 
the channel operates which is natural addition. In contrast, for a modulus channel setting, the 
analysis and the result follows in a relatively straightforward manner [24]. For example, reference 
[24] considers the wiretap channel with a cooperative jammer in a modulus channel where the 
source node uses lattice codes to send the secret message and the cooperative jammer uses lattice 
codes as a jamming signal to confuse the eavesdropper. The eavesdropper observes the modulus 
sum of the code and the jamming signal, where the sum operation is defined over a finite group. 
In this setting, the observation of the eavesdropper is independent from the secret message as 
in the case of the one-time pad in [1], and the secrecy rate computation can be done as shown 
in [24]. However, as commented in [24], in a Gaussian channel, the eavesdropper receives the 
sum of the signal from the source and the jamming signal, where the sum operation is over 
the ^-dimensional real space rather than over a finite group. The property of independence 
is therefore lost and the method in [24] no longer applies. While one might attempt to use a 
brute-force method to compute the secrecy rate for finite N, lattice codes in general offer the 
largest rate when the dimension N — ► oo, and this makes designing any such brute-force method 
impractical. 

The first contribution of this work is to solve the problem of secrecy rate computation when 
the lattice codes are employed. 

Most lattice codes for power constrained transmission have a similar structure to the one 
used in [24]. First, a lattice is constructed, which should be a good channel code for the 
noise/interference. Then, to satisfy the power constraint, the lattice, or its shifted version, is 
intersected with a bounded set, called the shaping set, to create a set of lattice points with finite 
average power. Commonly used shaping sets include the sphere or its shell [21], and in the 
case of nested lattice code, the fundamental region of a lattice [25]. The choice of the shaping 
set, in general, does not result in significant performance difference when there are no secrecy 
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constraints. However, as we show in this work, one shaping set is amenable to secrecy rate 
analysis. In particular, we prove that using nested lattice codes, the real sum of two lattice points 
from the same codebook leaks at most 1 bit of information per channel use to the eavesdropper 
regarding the value of one of the lattice points, if the other point is independently generated and 
uniformly distributed over the codebook. This result allows bounding of the equivocation rate 
and opens the door for applying nested lattice codes to achieve secrecy in Gaussian channels. 

The second contribution of this work is that we show, for a large class of two-user Gaussian 
channels with real channel gains, structured codes provide unbounded gains in secrecy rate as 
compared to randomly generated Gaussian codebooks at high SNR. 

We consider a Gaussian wiretap channel when there is an external transmitter that can help 
jam the eavesdropper, i.e., a cooperative jammer. This model is significant in that it can be 
viewed as a special case of many two-user Gaussian channel models with interference and 
secrecy constraints, including the Gaussian MAC-wiretap channel [12], the Gaussian interference 
channel with confidential messages [26] and the Gaussian interference channel with an external 
eavesdropper [27, Section VI], [16]. This channel, which is also called the interference assisted 
wiretap channel in reference [28], is interesting also because it retains the essence of the problem 
on the trade-off between secrecy and interference. It is known that introducing interference via 
a cooperative jamming node into the channel may increase the uncertainty observed by the 
adversary and hence allow for a higher rate of secret messages [12], [13], [26], [28], [29]. 
However, to harvest this benefit, the interference should be introduced in an intelligent way such 
that it is more harmful to the adversary than it is to the intended receiver of the messages. 
Hence, the key is to achieve a fine balance between secrecy against adversary and harmful 
interference to the intended receiver. The current state of art is such that the achieved rate is still 
far from the outer bounds for this model [28], despite recent efforts [30]. For example, the genie 
outer bound from [28] increases with power P at the speed of 0.51og 2 (P) [30, (69)], while the 
achievable secrecy rate converges to a constant when P — > oo [28, Theorem 2]. This means the 
gap between the achievable rate and the outer bound is unbounded and the trade-off between 
secrecy and interference is not well-understood. In fact, once the channel model is such that 
the intended receiver is not harmed by the introduced interference, the achieved secrecy rate 
immediately comes within 0.5 bits/channel use of the capacity region, as was shown for the one 
sided interference channel in [31], the orthogonal MAC wiretap channel in [27] and the two-hop 
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relay channel in [32] when the power of relay is large. 

Since the gap between the achievable secrecy rate and its outer bound is most pronounced 
at high SNR, in this work, we shall use secure degree of freedom as the performance measure, 
which shows the high SNR behavior of the achievable secrecy rate. For a fading interference 
channel, using interference alignment [33] and fading across the channel states, reference [34] 
demonstrates that the secure degree of freedom can be made positive. However, in the absence 
of channel variations, the method in [34] does not apply and as described above, reference 
[28] shows that the achieved rate using Gaussian codebooks converges to a constant as power 
increases, which implies the obtained secure degree of freedom with Gaussian signalling is zero. 
In this work, using structured codes, we prove that a strictly positive degree of freedom is 
achievable for this case, as long as the channel model is fully connected and not degraded. This 
means that with the help of only one cooperative jammer, given enough power, an arbitrarily 
large secrecy rate is achievable for a Gaussian wiretap channel. 

In obtaining our results we consider all possible channel gain configurations with a fully 
connected channel model. The signaling scheme we find that yields the largest achieved secure 
degree of freedom calls for a partial alignment scheme, in which part of the interference is 
actually aligned with the intended signal. A final note is that, in this work, for the purpose of 
completeness, we also utilize integer lattices whenever necessary, as they can achieve a larger 
secure degree of freedom for a certain set of channel gain configurations of measure zero. This 
is mainly a consequence of their simple structure, for which the equivocation can be computed 
precisely, while for the nested lattice code the equivocation can only be bounded. 

The rest of the paper is organized as follows. Section HI] describes the mathematical tool found 
to bound the secrecy rate when nested lattice codes are used. Section [III] describes the wiretap 
channel model with a cooperative jammer. Section [IV] derives the secure degree of freedom 
with nested lattice codes. Section [V] compares it with the secure degree of freedom achieved by 
integer lattice and points out the cases where integer lattice is useful. Section [VI] computes the 
secrecy rate with finite power and demonstrates the advantage of the structured coding scheme 
over random Gaussian signaling even with moderate SNR. In Section IVIH we discuss the effect 
of imperfect channel state information. Section IVIIII discusses the channel model with complex 
channel gains. Section [IX] concludes the paper. Main proofs are presented in Appendices for 
clarity. 
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II. Results on Nested Lattice Codes 

A nested lattice code is defined as an intersection of an ^-dimensional "fine" lattice A and the 
fundamental region of an A-dimensional "coarse" lattice A c , denoted by V(A C ). A,A C C H N . 
The term "nested" comes from the fact that A c C A. The modulus operation is defined as the 
quantization error of a point x with respect to the coarse lattice A c : 

x mod A c = x — arg min \\x — u\\ 2 (1) 

u£A c 

where \\x — y\\ 2 is the Euclidean distance between x and y in H N . It can be verified that AflV(A c ) 
is a finite Abelian group when the addition operation between two elements x, y E A n V(A C ) 
is defined as 

x + y mod A c (2) 
The signal X N transmitted over N channel uses from a nested lattice codebook is given by 

X N = (u N + d N ) mod A c (3) 

Here u N is the lattice point chosen from A PI V(A C ), and d N is called the dithering vector. 
Conventionally, d N is defined as a continuous random vector which is uniformly distributed 
over V(A C ) [25]. We show in Appendix O that for the channel models considered in this work, 
a fixed dithering vector can be used. Either way, the nature of d N will not affect the result 
described below. In the following, we assume u N is independent from d N . We also assume that 
d N is perfectly known by all receiving nodes, and hence, is not used to enhance secrecy. 
As will be shown later, our goal in general will be to bound the expression of the form 

I«;AT ±X»,d»,d») (4) 

which will correspond to the rate of information leaked to the eavesdropper. Here uf , Xf ,df 
correspond to the u N , X N , d N mentioned above respectively. That is to say that uf E An V(A C ); 
df is the dithering noise; Xf = (uf+df) mod A c . In addition, uf, df, i — 1, 2 are independent. 
To bound ©, we start from the following result, which we will term the representation theorem 
from here on: 

Theorem 1: Let ti,t 2 ,...,t K be K numbers taken from the fundamental region of a given 

K 

lattice A. There exists a integer T, such that 1 < T < K , and Yl, tk is uniquely determined 

fc=i 

by {T, E h mod A}. 

k=i 
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Proof: The proof is given in Appendix |A[ ■ 
Remark 1: As shown in its proof, Theorem Q] is a purely algebraic result and does not rely 
on the statistics of ti,t 2 , ...t K . □ 

When K = 2, we have the following corollary: 

Corollary 1: For Af ,i = 1,2 computed according to ©, i.e., Af = (uf + df) mod A c , 
there exists an integer T, such that 1 < T < 2^, and Af ± Af is uniquely determined by 
{T, Af ±Af mod A c }. 

Proof: Define — A c = {—x : x E A c }. Since G A c and the difference of any two 
lattice points is a lattice point, we have — A c = A c . This means that if A^ G V(A C ) , then 
-X% G V(-A c ). Since -A c = A c , this means that -X% G V(A C ) . Hence the corollary follows 
from TheoremE] by letting tf = X N and t% = ±A 2 7 
Using Corollary [TJ we have 



u 2 • 



J«;Af±Af,<,0 (5) 

=/«; Af ± Af mod Ac, T, <, <) (6) 

</«; Af ± Af mod A c , <, df ) + ff(T) (7) 

=/(< ; uf ± wf mod A c ) + H(T) (8) 

Here the T in © is the integer defined in Corollary \T\ 

Since A n V(A C ) is an Abelian group, when wf is independent from uf , and uf is uniformly 
distributed over A n V(A C ), we have [24], [35]: 

7(uf ; uf ± uf mod A c ) = (9) 

Applying it to ©, we find ([5]) is upper bounded by 

H{T) < N (10) 

Equations (l6l)-([T0l) imply 

l/(uf;Af ±Af,df,df)<l (11) 

Hence, if the eavesdropper observes Af ± Af and knows df , i — 1, 2, it can obtain at most 1 
bit information per channel use regarding the value of uf , if uf is independently from uf and 
uniformly distributed over A n V(A C ). We will use this result extensively in the sequel. 
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Fig. 1. The Gaussian Wiretap Channel with a Cooperative Jammer 

III. The Gaussian Wiretap Channel with a Cooperative Jammer 

A. System Model and Metric 

Consider the Gaussian wiretap channel with a cooperative jammer [28] shown in Figured] In 
this model, node Si sends a message W\ via X\ to node D\, which must be kept secret from 
node D 2 . Node S 2 , the cooperative jammer, sends signal X 2 . We assume the channel is fully 
connected, which means that none of the channel gains equals zero. This assumption is valid 
for the wireless medium. After normalizing the channel gains of the two intended links to 1, the 
received signals at the two receiving node Di and D 2 can be expressed as 

Y 1 = X 1 + ^X 2 + Z 1 

(12) 

Y 2 = VbX! ±X 2 + Z 2 

where Zi, i — 1, 2 is a zero-mean Gaussian random variable with unit variance, and y/a, Vb and 
Zi are real numbers. 

Let W\ be the estimate of W, estimated at node D\. For D 1 to receive W\ reliably, we require 

lim Pr (W l ^ Wi) = (13) 

n— +00 V / 

In addition, since W\ must be kept secret from D 2 , we require 

lim -H(Wi) = lim -H {WAY?) (14) 

n— >oo fl n— >oo ji 

The achieved secrecy rate R e is defined as: 

R e = lim -H(Wi) (15) 

n— >oo fl 

such that the conditions < fl"3l , (TT41) are fulfilled simultaneously. 
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Let X 1 = VbX l and Y 1 = \fbY x . Then from 432), we have 

Y l = X l + ^fahX 2 + VbZ 1 



(16) 



In the sequel, we will focus on this scaled model which will be more convenient to explain our 
results. 

There are two constraints on the input distribution to the channel model in (fT6l) : First, we 
assume there is no common randomness shared by the encoders of S\ and S 2 . This means, the 
input distribution to the channel is constrained to be 



where m is the number of channel uses involved. Intuitively, this implies that if X 2 is employed 
to send interference to confuse the eavesdropper, its effect can not be mitigated by coding Xi 
via dirty-paper coding [36]. 

Second, the average power of Xi is constrained to be Pi. If Xy is the jth component of Xi, 
this means: 



When computing the secrecy rate R e for the model in <fT6l) , we consider its n symbol extension. 
For the nested lattice code, n corresponds to the dimension of the lattice N. For the integer lattice 
code, n = 1. When we design the coding scheme, we make sure that the signals transmitted 
by node S 2 for every n-channel use block are independent from the signals transmitted by it 
during the other blocks. Hence, if every n channel uses are viewed as a single channel use, the 
channel is in effect a memoryless wiretap channel. This allows us to leverage the following result 
from reference [3]: Consider a memoryless wiretap channel Pr(Y, Z\X), where X is the channel 
input, Y is the observation of the legitimate receiver, Z is the observation of the eavesdropper. 
Then for a given input distribution Pr(X), any secrecy rate R e such that 



is achievable. The notation [x] + equals x if x > and otherwise. 

Adapting the result to our model, we observe that X, Y, Z corresponds to X™, Y™, Y£ respec- 
tively. Hence (fT9l) takes the following form: 



P pC,^i)p(x 2 m ) 



(17) 




(18) 



0<i? e < [I(X;Y)-I(X;Z)} 



+ 



(19) 



10 




Fig. 2. The Gaussian Wiretap Channel with a Cooperative Jammer as a special case of the Multiple Access Channel with an 
external eavesdropper, i.e., the MAC-WT [12]. 

Theorem 2: For a given n letter input distribution PrpTf), with the assumption that the 
signals transmitted by node S2 for every n-channel use block are independent from the signals 
transmitted by it during the other blocks, any secrecy rate R e such that 

< Re < -[/W; 1?) - / W; KT)]+ (20) 
is achievable for the model in (fT6l) . 

It is important to note that (fT9l) is proved with a random coding argument using joint typicality 
decoder. However, as we will show later, here the support of the input distribution Pr(X") is 
restricted to the lattice points and hence structure is imposed on the codebook. 

Remark 2: Note that even though (fT9l ) was originally derived for a channel model with discrete 
alphabets, the same result readily holds for a channel model with continuous alphabets with 
average power constraint at the transmitter [5, Section IV]. □ 

In this work, we will mainly be concerned about the high SNR behavior of the secrecy rate. 
Namely, 

Definition 1: The secure degree of freedom of the secrecy rate is defined as: 

R 

s.d.o.f. = limsup — (21) 

B. Relationship with Other Wiretap Channels 

The significance of the Gaussian wiretap channel with a cooperative jammer is that it can 
be considered as a special case of a large class of channel models with secret messages, as 
explained in the following: 



1 1 




Fig. 3. The Gaussian Wiretap Channel with a Cooperative Jammer as a special case of the two-user Interference Channel with 
confidential messages, i.e., IFC-CM [26]. 



z 




Fig. 4. The Gaussian Wiretap Channel with a Cooperative Jammer as a special case of the Interference Channel with an 
external eavesdropper, i.e., IFC-WT [27]. 



1) As shown in Figure [2] if node S 2 has a confidential message W 2 for D 1 , which must be 
kept secret from D 2 , then the channel is the MAC-wiretap channel [12]. 

2) If node 5*2 has a confidential message W 2 for D 2 , and the message must be kept secret 
from Di, then the channel is the interference channel with confidential messages [26]. 

3) As shown in Figure SI we can add another receiving node D to Figure [H to which node 
52 wants to sent a confidential message W 2 . Again W 2 must be kept secret from D 2 . Then 
the channel becomes the interference channel with an external eavesdropper [27, Section 
VI] [16]. 

Hence, it is clear that, any secrecy rate achieved in the Gaussian wiretap channel with a coopera- 
tive jammer is an achievable individual rate for all the three multi-user channels mentioned above. 
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An achievable secrecy rate region can be obtained by time sharing between these individual rates. 

C. Gaussian Signaling 

Reference [28] derived an achievable rate with Gaussian codebooks and power control for the 
channel in Figure CD It was shown, in the same reference that, with a Gaussian codebook, the 
achievable secrecy rate R e converges to a constant when the power constraints of node D x and 
D 2 goes to oo. [28, Theorem 2]. This result implies that the achieved secure degree of freedom 
is 0. It should be noted that in the achievable scheme of [28], node D 2 sends a sequence from 
a codebook randomly generated in an i.i.d. fashion according to a Gaussian distribution. This 
is the worst noise from the eavesdropper's (D 2 ) perspective if Gaussian i.i.d. signaling is used 
in X±, see, for example, [37, Lemma 2]. However, since the channel is fully connected, X 2 is 
also the worst noise for the intended receiver D x . This effect causes the secrecy rate to saturate, 
leading to zero secure degree of freedom. 

A moment's thought reveals that if we allow structure in X 2 , while the eavesdropper may 
potentially get more information about X\, the intended receiver Di may benefit more by 
exploiting the structure of the interference and neutralizing it more effectively as compared 
to the eavesdropper. Hence, the overall secrecy rate can be improved. In the following, we will 
show that this intuition is correct and use structured codes to derive the achievable secure degree 
of freedom for the Gaussian wiretap channel with a cooperative jammer. 

D. Main result 

Theorem 3: For the channel model in Figure CD a positive secure degree of freedom is 
achievable when the following condition holds: 

1) The channel is fully connected. Hence no channel gain is zero. 

2) The channel is not degraded [10] or reversely degraded. This means the signal received 
by the intended receiver can not be expressed as a degraded version, see [10, Section II], 
of the signal received by the eavesdropper, and vice versa. 

The theorem is proved by considering different range of values for \fab in (fT6l) . For almost 
all values of \fab, Theorem [3] can be proved using the nested lattice coding scheme, which will 
be shown in Theorem @] and Corollary [2l However, there is still a set of channel gains which 
has measure zero, where the secure degree of freedom achieved by Theorem |4] is zero. For these 
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special cases, we shall use integer lattices to achieve a positive secure degree of freedom. These 
cases will be discussed in Section |V] 

IV. Achievable Secure Degree of Freedom with Nested Lattice Codes 
We notice that any y/ab, y/ab =/ 0, can be represented in the following form: 

\fah = p/q + j/q (22) 

where p, q are positive integers, and — 1 < 7 < 1, 7 ^ 0. In this case, the channel model (fl6l) 
can be expressed as: 

qY 1 = qX x + {p + -f)X 2 + qVbZ 1 (23) 
Y 2 = X l ± X 2 + Z 2 (24) 

Using this notation, we have the following theorem regarding the achievable secure degree of 
freedom: 

Theorem 4: The following secure degree of freedom is achievable using nested lattice codes 
when < I7I < 0.5: 

"0.25 log 2 (a)-l' ' 



±log 2 (a/3 + 1) 



(25) 



where 



1 - 2 7 2 + yr^w n „ 

a = — 4 (26) 

and 

(3 = q 2 + (p + lf (27) 



Proof: The proof is given in Appendix [Bj ■ 
Remark 3: In the proof of Theorem HI we use a layered coding scheme. That is to say that the 
transmitted signal X^ , k — 1, 2 is the sum of codewords from M layers as shown in (|69l) , where 

in (|69l ) is the signal sent by the Sk in the zth layer. A nested lattice pair (Aj, A c 4 ), A c $ C Aj, 
is assigned for each layer. X^ is computed from a lattice point from Aj n V(A c i ) as shown in 
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Layered coding scheme with lattice codes was first used in [38] for a J^-user interference 
channel (K > 3) without secrecy constraints. The difference here from [38] is as follows: In 
[38], a sphere shaped lattice code is used for each layer. Here, for each layer, a nested lattice 
code is used instead. As a result, the corresponding decoding algorithm and error probability 
analysis is different. Reference [38] uses the results from [39]. The rate derivation in our work 
follows reference [25]. □ 

Remark 4: In the proof of Theorem HI we use the result from [3], i.e., ([201) , in (|97l ). X™ in 
(1201) corresponds to the lattice points from all layers u^, i = M, as shown in ([97l ). □ 

Remark 5: When computing the secrecy rate, we assume the eavesdropper knows (i) the 
channel noise, as shown by (11071) . and (ii) the contribution to the received signal from each 
layer, as shown by (11081) . For each layer, the information leaked to the eavesdropper takes the 
form ©. This can be seen from (11081) in the proof of Theorem 0] Hence with M layers, the 
overall information leaked to the eavesdropper is bounded by M bit per channel use, with each 
layer leaking at most 1 bit per channel use. □ 

A consequence of Theorem |U is as follows: 

Corollary 2: For y/ab, such that 2\fab is not an integer and 1/y/ab is not an integer, the 
secure degree of freedom given by Theorem |4] is positive. 

Proof: We first verify Corollary |2] holds for interval 1 < \fab~ < 2. The value of (1231) is 
plotted Figure [5] in 1 < \fah < 2. To prove that (1231) is positive in this range, it suffices to choose 
(p — 1, q — 1), (p — 2, q — 1), (p — 3, q — 2), and let \[ab = p/q + j/q. A higher secure degree 
of freedom can be achieved by choosing other values for p, q, but for clarity, these curves are 
not plotted in Figure \5\ 

Note that for a fixed pair of p, q, by changing 7, (1231) takes the shape of a spur. Hence Figure [5] 
includes three such spurs. The value of (|25T) converges to 0.5 when 7 converges to 0. However, 
since 7 / 0, the peak of the spur is not included. Hence a positive secure degree of freedom 
cannot be gauranteed by Theorem |4] only when \fab = 1, 1.5 or 2. 

We next argue that Corollary [2] holds for interval n < \fab~ < n + 1 for all integer n, n > 1. 
This follows from the fact that the denominator of (1231) is always positive. Hence the positivity of 
(1231) is only determined by its numerator, which is only a function of 7. When n < \[ab < n + 1, 
we can simply choose the following three pairs of (jo, q): (p = n, q = 1), (p = n + l,q — 1), 
and (p = 2n + l,q = 2). The positivity of (1231 ) in this interval [n,n + 1] should be the same 
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Fig. 5. The value of i25\ when 1 < Vab < 2. The red dashed line is the contour. 

as the positivity of (|25T) in interval [1, 2]. Since ([25]) is verified to be positive in interval [1,2] 
except for the cases stated in Corollary [2l the same case holds for interval [n, n + 1]. 

We next verify Corollary [2] holds for the interval 1/2 < yah < 1. The value of (|25T) is plotted 
in Figure [6] The two spurs in Figure [6] follows from choosing p — 1, q — 1, and p = 1, q — 2. 

Finally we consider the interval l/(n + 1) < < 1/n for n > 1. For this interval, we 
can choose p — l,g = n + 1 and p — l,q — n. Again, since the positivity of (|25l) is only 
determined by 7, Corollary [2] holds for this interval [l/(n+ 1), 1/n] since it holds for interval 
1/2 < ylti < 1. 

Hence we have completed the proof of Corollary [21 

■ 

Remark 6: It should be noted that when 2\fab or l/\fab is an integer, and \fah is not equal 
to one of {0.5, 1, 1.5, 2}, we can achieve a positive degree of freedom using nested lattice codes 
also. This entails a different way to align the interference. This alignment scheme is not described 
in detail because the secure degree of freedom achieved by integer lattice codes is better; see 
also Remark |9j □ 

V. Comparison with Integer Lattice Codes 

Another frequently used class of structured codes is the integer lattice codes [20], [23]. In 
this section, we compute the secure degree of freedom using integer lattice codes and compare 
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Fig. 6. The value of i25\ when 1/2 < Vab < 1. The red dashed line is the contour. 

it with that of the nested lattice codes from Section [IV] 

A. Integer Lattice Codes 

An integer lattice code with parameter Q is composed of points in the set [0, Q)nZ where Z is 
the set of all integers. The point can be scaled and shifted to obtain the actual transmitted signal. 
Due to its simple structure, unlike nested lattice codes, for this code the rate of information 
leaked to the eavesdropper can be computed precisely (rather than lower bounded). As will be 
shown later, the information leaked to the eavesdropper can be bounded by the sum of several 
terms, each with the following form: 

f(Q) = I(X 1 ;X 1 ±X 2 ) (28) 

where Xi, i — 1, 2 is uniformly distributed over [0, Q)(~]Z. f(Q) can be bounded by the following 
lemma: 
Lemma 1: 

1 1 1 1 7re 

f(Q) < - log 2 (27re(- - — )) < - log 2 ( y ) < 0.8 (29) 
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Proof: Equation ([291 ) follows directly from [23, Lemma 12]. H(X 1 + X 2 ) can be bounded 



as 



H(X 1 + X 2 )<Uog 2 (2ne(2P' + ±)) 



(30) 



where P' is the variance of X k , k — 1, 2, which is given by 

2 



P' 



1 



fc=0 



Q-l 
fc=0 



Q 2 



i 



12 



(3D 



Substituting d3T) into dH we get ([29]). ■ 
As shown by (|29l . the leakage rate is smaller than the 1-bit bound for the nested lattice code, 
which suggests that integer lattice codes may be useful for cases where the secrecy rate offered 
by the nested lattice code is 0. 

Remark 7: Compared to the nested lattice code, the drawback of the integer lattice code is its 
energy inefficiency at finite power. Moreover, typically integer lattices rely on certain number 
theoretical results. Hence, they only yield good (secure) degree of freedom with very specific 
channel gain configurations. Luckily, these channel gains happen to be the cases not covered by 
nested lattice codes in Corollary [2l □ 



B. Secure Degree of Freedom 

1 ) When \fab is algebraic irrational: When \fab is algebraic irrational, we observe that the 
achievable secure degree of freedom can be derived using a result from [23]: 

Theorem 5: A secure degree of freedom of 1 /2 is achievable when y/ab is an algebraic 
irrational number. 

Proof: The proof is given in Appendix [Ql ■ 
Remark 8: When \fab = 1 and all channel gains are positive, the channel is degraded and 
from the outer bound in [28], the secure degree of freedom is 0. Since algebraic irrational 
numbers are dense on the real line, it follows that the secure degree of freedom is discontinuous 
at \fab = 1. □ 

Theorem \5\ applies only when \fab is algebraic irrational, which is a set of measure on the 
real line. Hence other schemes are needed to cover the case where \fab is either rational or 
transcendental. 
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2) When y ab > 2 or 1/yab > 1/2: For this range of yab, we find Q-bit expansion scheme 
similar to the one in [20] is useful in deriving the secure degree of freedom. 

Theorem 6: Let Q = yfab if yah > 2. Otherwise, let Q = \j\fah. Let [Q\ denotes the 
largest integer smaller than or equal to Q. Let Q > 2. Then following secure degree of freedom 
is achievable: 

~iio g2 LQj f([Q\y 



2 log 2 Q 21og 2 Q 
where f(Q) is defined in (l28~l) . (l32l is lower bounded by 

1 log 2 [Q\ log 2 (2^ (§) - ^ 



(32) 



(33) 



2 log 2 Q 41og 2 (Q) 
If Q = 2, OH) equals 0.25. 

Proof: The proof is given in Appendix [0 ■ 

In Figure [/J we plot the secure degree of freedom achieved by Theorem [6l The actual 
performance curve in Figure [7J corresponds to (|32~1) . The lower bound curve corresponds to 
( 1331 . The zigzag shape of the curve is a consequence of the [ J operation on Q in (|32l . We 
notice as \fab moves away from 1, the lower bound given by (|33l becomes tighter, and the 
secure degree of freedom converges to 0.5. 

In Figure [H we compare the secure degree of freedom achieved by nested lattice code with 
those achieved by integer lattice coding scheme described in this section. As shown by Figured! 
neither scheme dominates the other in performance for all channel gains. Nested lattice code 
offers good secure degree of freedom near the peak of spurs, while integer lattice coding scheme 
described in this section is at advantage when 2\fab are integers and \fab~ > 2. 

Remark 9: A coding scheme similar to the one described in this section can be constructed 
with a nested lattice code. However, since we can only bound the secrecy rate rather than compute 
precisely the rate of information leaked to the eavesdropper as we did for the integer lattice code, 
the provable secure degree of freedom turns out to be smaller. □ 

3) When y/ab = 1: When Y 2 = X\ + X 2 + Z 2 , the channel is degraded. The secure degree 
of freedom is known to be [28]. 

However, when Y 2 = X\ — X 2 + Z 2 , we have the following result: 

Theorem 7: When Y 2 = X\ — X 2 + Z 2 , a secure degree of freedom of 0.0548 is achievable. 
Proof: The proof is given in Appendix [0 ■ 
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10" 3 10" 2 10" 1 10° 10 1 10 2 10 3 

(ab) 05 

Fig. 7. Secure degree of freedom achieved by integer lattice coding scheme in Section IV-B2I 

4) When yah = 1.5: Here we mimic the coding scheme in Section HVl when p = l,q — 1. 
Note that positive secure degree of freedom can not be achieved by the nested lattice code in 
Section [IV] in this case. This is mainly a consequence of leaking 1 bit per layer in the layered 
coding scheme. Here, we replace the nested lattice code with Q-bit expansion similar to the one 
used in Section IV-B2I and leverage the fact that integer lattice leaks less than 0.8 bit per layer 
to show that a positive secure degree of freedom is achievable. 

Theorem 8: When \fab = 1.5, a secure degree of freedom of 1/6 is achievable. 

Proof: The proof is given in Appendix ■ 

Remark 10: This scheme can be extended to the case when \fab = 1 + 1/Q with Q being an 
integer greater than 2. In this case, we let a^; = if i mod 2 = 1. Otherwise a^, is taken from 
[0, Q — 1] fl Z. However, to make room for the carryovers, the least significant bit of the binary 
representation of a k i must be zero. Hence, 1 bit is lost per layer due to the carryovers. As a 
result, the achieved secure degree of freedom turns out to be smaller than those achieved with 
nested lattice codes. Similar coding schemes can be designed for other values of \fab. However, 
it is difficult to find a uniform description of such codes that achieves a better performance than 
that of nested lattice code. □ 

Remark 11: At this point, given the description of all these coding schemes using structured 
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0.5 1 1.5 2 2.5 3 3.5 4 



(ab) 1/2 

Fig. 8. Comparison of secure degree of freedom achieved by nested lattice code (solid blue) and integer lattice coding scheme 
in Section IV-B2I (dashed red) 

codes, the careful reader might find it interesting to ask whether a Gaussian signaling scheme 
can be devised by mimicking these schemes. We notice this would be difficult for the schemes in 
Section IV-B II or Section [TV] since Section IV-B 1 1 relies on [23] which uses results on Diophantine 
approximation, and Section [IV] uses partial interference alignment which is difficult for codes 
without a structure. We are then left with the schemes in Section IV-B2I and Section IV-B3I 

In Sections [V-B2I and IV-B31 we use multi-level coding schemes. When computing the secrecy 
rate, it is assumed the eavesdropper has perfect knowledge of the channel noise Z 2 and the 
part of received signals from each layer. Note that if a random codebook is used in this model 
for each layer, revealing Z 2 to the eavesdropper would essentially reveal to the eavesdropper 
the codeword used at each layer. This can be seen by observing that the capacity region of a 
2-user Gaussian MAC channel will expand to the whole two dimensional plane if the variance 
of the channel noise goes to 0. On the other hand, if Z 2 is not revealed to the eavesdropper, 
then it would be difficult to compute the equivocation rate if a multi-level coding scheme is 
used. Suppose, to solve this problem, we allocate a small amount of power at each layer to send 
noise. Doing so would then cause problems at the intended receiver, since the noise can not be 
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Fig. 9. Secrecy Rate with Finite Power, a — b = \f\, Ptotai is the power of Xi or X2 in i Kit 



decoded and subtracted as the decoder proceeds from higher layers to lower layers. Therefore, 
we conclude, even if a Gaussian signaling scheme could be eventually found, it would not be a 
straightforward extensions from any of the coding schemes described in this paper. □ 

VI. Finite Power Scenario 

In practice, it is also interesting to look at how fast the asymptotic behavior, i.e., the secure 
degree of freedom, kicks in as the power increases. In Figure |U we plot the secrecy rate against 



power when a = b = </-. The power P to tai is the variance of X 1 or X 2 in (fT6l) . 



Since yab — y | is algebraic irrational, the integer lattice coding scheme from Section IV-B 1 1 
provides the largest secure degree of freedom, which is 0.5. We use (11341) to compute its secrecy 
rate. 

From (13TI) and (11251) . the average power of this coding scheme is given by 

.Q 2 -l 



total 



pl/2+2e] 



12 



(34) 



The first term in (|34|) is due to the scaling factor P 1 / 4+e in (11251) . We then choose different values 
for e and plot the largest achievable power rate pair region in terms of {10 log 10 Ptotai, Re} m 
Figure |U 
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For comparison, we plotted the largest secrecy rate offered by the Gaussian random codebook 
when the power constraints go to oo. Recall that when the power constraints go to oo, the secrecy 
rate is not affected by whether the channel is described by (fT2l) or ([TBI , which only affects the 
power constraints. To leverage the result from [28], we use the channel model description in 
(fl"2~l) . Since a = b = ^2/3 < 1, (fT2l) corresponds to the second case in [28, (11)11 The power 
unconstrained secrecy rate offered by the Gaussian random codebook is given by 

_ lim R e = - log 2 — (35) 

p fe ^oo,/c=i,2 2 ao 

As shown by Figure [9l the secrecy rate offered by integer lattice is greater than the power 
unconstrained Gaussian signaling scheme when 10 log 10 Ptotai > 30. 

We also plot, in Figure [9l the secrecy rate offered by the nested lattice code, by choosing 
different p, q and the number of layers M . The secrecy rate is computed according to (II 1 1 1) . Ptotai 
is given by (l96l) . As expected, different choices of p, q results in different slope of the curve, 
hence different secure degree of freedom. When 101og 10 P to4a ; < 60, the nested lattice code can 
achieve a larger secrecy rate than integer lattice code due to its power efficiency. However, if 
Ptotai keeps increasing, the secrecy rate offered by the integer lattice is the largest, due to its 
high secure degree of freedom. 

To summarize, these results demonstrate that the coding scheme presented in this paper can 
outperform random Gaussian signaling at practical SNR values as well. 

VII. Channel Gain Mismatch 

The coding schemes in previous sections require aligning lattice points at the eavesdropper, 
which relies on accurate channel state information. It is conceivable that in practice such accurate 
channel state information may be difficult to obtain and it is reasonable to ask if the nested lattice 
coding scheme is still able to provide secrecy rate with imperfect channel state information. In 
this section, we explain how to compute the secrecy rate in that case. 

The channel model is shown in Figure \T0\ This is the same channel model we have been using 
except now, the channel gain between Si and D 2 has an estimation error oti for the ith channel use. 
For simplicity, we assume the remaining channels are estimated perfectly. The channel estimation 



1 [28] only considers the case Y2 = Xi + X2 + However, since the Gaussian distribution is symmetric around zero, the 
case I2 = X\ — X2 + Z2 has the same secret rate with Gaussian input distributions. 
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Fig. 10. Wiretap Channel with a Cooperative Jammer S2, where the channel gain between Si and D 1 has an estimation error 
on £ [— Omax, a m ax] for the ith channel use. oli is known by D 2 but not known by Si or S2. 



error, is independent from the signal transmitted by node Si, and ccj E [— a max , ct max ]. 
«max > 0. We also assume the eavesdropper, node D 2 , has perfect knowledge of c^, but the 
other nodes only know a max . In this setting, perfect channel state information at the eavesdropper 
is obviously a pessimistic assumption, but it is insightful in that it will reveal a worst case 
performance. 

With these assumptions, we have the following theorem: 

Theorem 9: Let Pi be the total power consumption of Si. If for a constant c > 0, 

«Lx < 4" (36) 

Then the some secure degree of freedom given by Theorem @] is achievable. 

Proof: The proof is given in Appendix [H] ■ 
Remark 12: Imperfect channel state information for other links, for example, on the link 
between legitimate transmission pair 5*1 and Di can be easily catered for, by using the existing 
results for the model without secrecy constraints [40, Section IV]. □ 

VIII. Complex Channel Gains 

The role of complex channel gains in interference alignment has recently been considered [41]. 
For the purpose of completeness, in this section, we consider the channel model with complex 
channel gains. We demonstrate that, in this case, by taking advantage of the phase difference 
existing in the complex channel model, it is easier to achieve positive secure degree of freedom 
using simple random coding arguments. 
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The complex Gaussian wiretap channel with a cooperative jammer is shown in Figure QT] 
Since the channel is fully connected, after normalization, the signals received by the two nodes 
Di and D 2 can be expressed as 

Y 1 = X 1 + y/Ee>^X 2 + Z x 

(37) 

Y 2 = Vbe^X, + X 2 + Z 2 

where j = Zi, i — 1, 2 is a rotation invariant complex Gaussian random variable with unit 

variance. This means E[\Zi\ 2 } = 1. Let X x = \/be^ h X x and Y 1 = \fbe^ h Y x . Let ip = ip a + ip b . 
Z t = eP^Zx. Z 2 = Z 2 . Then from 03) we have 

Y l = X l + Vd>^X 2 + VbZ x 

(38) 

^2 = X\ + ^2 + Z 2 

The input distribution to this channel is constrained to be of the form given in (fTTl) . The power 
constraints of this channel are given by (TT8l) . The secrecy rate and secure degree of freedom is 
defined as in Section IIII-AI Given these, we have the following theorem: 

Theorem 10: A secure degree of freedom of 1 is achievable if tp ^ or it mod 2% 
Proof: Let ImXj = 0,i = 1,2. Let cot a; = cosx/ sinx. Then since Iml2 = \t&Z 2 , \yoY 2 
does not provide any information about W\ to the eavesdropper. Hence we can assume the 
eavesdropper receives ReF 2 only. Node D\ computes g(Yi) = ReYi — cot^Imli. Then the 
channel can be expressed as 

g(Y l ) = ReXx + Vb (ReZ 1 - cot iplmZx) (39) 
ReF 2 = ReXx + ReX 2 + ReZ 2 (40) 
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By transmitting i.i.d. Gaussian noise via ReX 2 , the channel is equivalent to a Gaussian wiretap 
channel. It is thus known that the following secrecy rate is achievable [4]: 



(41) 



(6csc 2 ^)/2/ \P 2 + l/2 / 
where (b esc 2 ip)/2 is the variance of the noise term \/b (ReZi — cot ipImZi). C(x) = ~ log 2 (l + 
x). Pi, % = 1,2, is the average power consumed by node Si. Let Pi — Pi, i — 1,2 and apply 
the definition in (1211) . we observe that a secure degree of freedom of 1 is achievable for this 
channel. ■ 
Remark 13: It is understood that if log 2 (^J2 Pij is use d instead of |log 2 (^J2 Pi) m me 
definition of secure degree of freedom (1211) . the achieved degree of freedom should be changed 
accordingly from 1 to 0.5. □ 

Remark 14: Since the set of complex numbers is isomorphic to the set of scaled 2x2 
orthogonal matrix, the result here can be interpreted as Gaussian codes being able to achieve 
positive secure degree of freedom for a wiretap channel with a cooperative jammer where each 
node has 2 antennas, the channel has real channel gains, and the channel matrix is a scaled 
orthogonal matrix. □ 

IX. Conclusion 

Structured codes were shown recently as a useful technique to prove information theoretic 
results. In this work, we showed that structured codes are also useful to prove secrecy results. We 
first provided an analytical tool that is useful in computing secrecy rate for nested lattice codes. 
Then, using integer lattice codes and nested lattice codes, we prove that a positive secure degree 
of freedom is achievable for the Gaussian wiretap channel with a cooperative jammer and real 
channel gains as long as the channel is not degraded and the channel is fully connected. Since 
the channel is a special case of the MAC wiretap channel, the two-user interference channel with 
confidential messages, and the two-user interference channel with an external eavesdropper, the 
result means the secure degree of freedom for all these channels are positive as well as long as 
the channel is not degraded and is fully connected. As a consequence of this high SNR result, we 
are able to claim that, Gaussian signaling is not optimal for a large class of two-user Gaussian 
channels. 
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The result here provides further evidence that structured codes are useful. A list of examples 
that structured codes outperform simple random coding arguments in non-secrecy problems can 
be found in [17]. Employing structured codes in secrecy problems was first proposed by the 
authors [42]. Up to date structured codes are found to be useful for relay channels due to the 
possibility of compute-and-forward [17], [42], or for interference channels with more than two 
users due to the possibility of interference alignment [23], [38], [43]. The result here provides 
the example that structured codes are useful for two-user Gaussian channels as well. 

An added practical value of our result is that it implies that the cooperation of just one node 
is sufficient to achieve an arbitrarily large secrecy rate given enough power. Since large scale 
cooperation involving multiple nodes is not essential, this fact enhances the robustness of the 
network in an adverse environment. 



Appendix A 
Proof of Theorem CD 

Let V be the fundamental region of the A^-dimensional lattice A. For any set A, define a A 
as a A = {ax : x E A}. Then we have: 

K 

{Y,tk:t k eV,k = l...K} = KV (42) 

k=l 

By definition of the modulus A operation, we have 

K K 

tk mod A = ^t fc + x, x E A (43) 

k=l k=l 



Therefore, the theorem is equivalent to finding the number of possible x in equation (1431) for a 

K 

given tk mod A such that t k G V,fc = 1...K. This, by (142)) . implies: 

fe=i 

K 

^2 t k mod A - x E KV, x E A (44) 

k=l 

To do that, we need to know a little more about the structure of lattice A. Each point in a lattice, 

N 

by definition, can be represented in the following form [44]: x — J2 a i v ii v i £ R N ,o,i E Z. 
{<2j} is said to be the coordinates of the lattice point x under the basis {fj}. 

Based on this representation, we can define the following relationship: Consider two points 
x, y E A, with coordinates {a^} and {bi} respectively. Then we say x ~ y if at = foj mod K, 
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i = 1...N. It is easy to see the relationship ~ is an equivalence relationship. Therefore, it defines 
a partition over A. A partition defined in this way has the following property: 

1) Depending on the values of aj — &j mod K, there are K N sets in this partition. 

2) The sub-lattice A'A is one set in the partition. The remaining K N — 1 sets are its cosets. 
Let Cj denote any one of these cosets or KA. Then Cj can expressed as Cj = A' A + G A. 
It is easy to verify that {x + KV, x G Cj} is a partition of ATT?.^ + y-i, which equals 1Z N . 

We now return to solve (|43l) . Since Cj, i = l...K N is a partition of A, (l43l) can be solved by 
considering the following A^ equations: 

K 

t k mod A - x G KV, x G Cj (45) 

fc=i 

From (l42l) . this means J] i fc mod A G x + KV for some x G Cj. Since {x + KV, x G Cj} is 

fc=i 

a partition of 1Z N , there is exactly one x G Cj that meets this requirement. This implies for a 

if 

given X) ^fc mod A, and a given coset Cj, (1451) only has one solution for x. Since there are 

k=l 

K N such choices of Cj and hence such equations as shown in (|45T) . (1431) has at most A^ 

x if 

solutions. Hence each m od A corresponds to at most K points of tk- This completes 

fc=l k=l 

the proof of the theorem. 

Appendix B 
Proof of Theorem @] 

A. Some Useful Results on Decoding Nested Lattice Codes 

In this section, we introduce some supporting results adapted from [25] which we will use in 
proving Theorem HI 

Consider K + 1 A^-dimensional lattices A P j,z = 0, K, such that A P j C H N ,i = 0, K 
is Rogers-good for covering and Poltyrev-good for channel coding. The definition for a lattice 
to be Rogers-good can be found in [45, Section II.B]. The definition of Poltyrev-good can be 
found in [45, Section III.D]. Reference [45] gaurantees the existence of A Pi i,i = 0, K which 
are good in the above sense. 

Construct the fine lattice A as in [25, Section 7] such that A Pj0 C A. Hence {A, A P;0 } forms 
a nested lattice pair. 

Define independent random variables U$ , U^, ...Uft, such that , i = 0, K is uniformly 
distributed over the fundamental region of A P)i . 
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Define a 2 (Ui),i = 0, K as the variance per dimension of . When N increases, we scale 
A and A Piil i = 0, ...,K such that a 2 (Ui) remains unchanged. Note that scaling does not affect 
the goodness of a lattice. 

Define Of,i = 1, ...,K, as zero mean Gaussian random variables such that 

Of ~A/"(0,(x 2 (Of)I) (46) 

where I is an iV x iV identity matrix, a 2 (Of) is a scalar chosen as the the variance per dimension 
of a random variable uniformly distributed over the smallest ball covering V(A Pti ). 
Define e (A P;i ) is as in [25, (67)]: 

e (A P)i ) = log (^j + \ log 2neG* N + i (47) 

where R U:i , Rn are the covering radius and effective radius of A P:i respectively. G* N is the 
normalized average power of A^-dimensional sphere and converges to as N — ► oo. Since 
A p j is good for covering, we have — > 1 as N — > oo [25, Section 7]. Hence 



lim £ (A p <) = (48) 



a (Of) is bounded by [25, Lemma 6]: 

2 



^ W) <^ (0f) <(^j ^) (49, 
Hence we have 

lim a 2 (Of) = a 2 (U % ) (50) 

Define Z N as a A^-dimensional vector which is composed of zero mean i.i.d. Gaussian random 
variables, each with variance a 2 . Z N is independent from Uf,i = 1, K. 

Define t N as a lattice point in A n V(A Pi0 ). t N is independent from Z N and Uf, i — 1, .fT. 
Define Y N as 



\ 8=1 



Y» = t n + V [/f + Z JV mod Ap, (51) 



for K > 1, and 



when K = 0. 



= (t 7V + Z 7V ) mod Ap, (52) 
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Define t N as the value for t N decoded from Y N using an Euclidean distance decoder: 



N 



t 



■N 



arg mm 

u N eAnV(A p ,o) 



Y 



N u N 



Define the rate R of the nested lattice code book A n V(A Pj0 ) as 

Ro = log 2 |V (A P)0 ) n A| 

where \S\ is the cardinality of a set S. 

With these notations, we have the following results: 
Lemma 2: If K = and hence Y N is given by (l52l) . and 

Then for each N dimensions there exist lattices A, A p such that 

Pr (Z N i V(A)' 



(53) 



(54) 



(55) 



(56) 

decreases exponentially fast with N. 

Proof: The lemma follows by repeating the proof of [25, Theorem 5] when we choose the 
scalar a defined therein to be 1 and consequently N" defined therein becomes N" = (1 — a)U + 
aN = N. ■ 
Lemma [2] can be extended into the following form: 

Lemma 3: When K > 1, and hence Y N is given by (IBTI) . and 

/ \ 

o 2 (Uo) 



R < \ lQ S2 



K 



\ 



(57) 



7 



a 2 + E ct 2 (Ui 

i=l 

Then for each N dimensions there exist lattices A, A Pj j, t = 0,...K such that Pr(t N ^ t N ) 
decreases exponentially fast with N. 

Proof: Let fx (x) denote the probability density function of any continuous random variable 
X. We first use the following fact shown in [25, (200)]: 

fu» (*) < e N<Ap ' i] fo» (x) (58) 
which "approximates" Vf with the Gaussian random variable Of. e (A Pji ) is defined in (|47T) . 
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From ( |58l ), since Uf , i = 1,...,K, Z N are independent, we have 



f K (x) < e *=i f K (x) 

E +z" E of +z* 



(59) 



which means: 



Pr (E^f + Z^V(A)) 

Pr (j20? + Z N tV(A)) 

K 

From Lemma [2l since E Of + Z N is an i.i.d. Gaussian vector, we have, for a given K, 

i=i 



ATE £ (A P ,i)_ 

<e i=1 



(60) 
(61) 



(62) 



decreases exponentially fast with iV if 



< - log 2 



° 2 (U ) 



K 



a 2 + lim,^ E a 2 (Of; 



(63) 



which yields (l57l) because of (l50l) . This, along with (l48l) . means (|6TT) decreases exponentially 
fast with iV. Therefore (1601 ) decreases exponentially fast with N. Since (1601 ) is an upper bound 
on Pr(t iV ^ t N ), we have proved Lemma [3] ■ 
The following result is adapted from [25, (89)]. 
Lemma 4: Define fi as 



o 2 (U ) 



cr 2 + E cr 2 W 
i=l 



(64) 



Then if fj, > 1, the probability 



Pr f/f + mod A Pi0 ^ £ U? + 

\i=l i=l 



(65) 



decreases exponentially fast with respect to N. 
Proof: (1651) is upper bounded by 



Pr (X>f + Z N tV(A Ptl 
\i=i 



(66) 
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which, in turn, by following similar steps which lead to ( |6TT) , is upper bounded by 

K 

e - Prfeof + Z N ^V(A pja )j (67) 

Since A p0 is taken from the lattice code ensemble defined in [25, Section 7], according to [25, 
(78)], we have 

Pr (E °f + Z N i V (A p , )) < e-^^W-^W) (68) 

where Ep(fi) is the Poltyrev exponent defined in [25, (56)]. on (1) is any function of A" such 
that limTv^oo o N (l) = [25, Section 1]. Since E p (ji) is positive for ji > 1, we have proved 
Lemma |4] ■ 



B. The Coding Scheme in Theorem @ 

Let Xf be the signal sent by node over A^ channel uses. Xf is the sum of codewords 
from M layers as shown below: 

M 
i=l 

Xft is the signal sent by the Sk in the zth layer. 

For each layer, we use the nested lattice code described in Section HO Let {Aj, A c i } be nested 
lattice pair assigned to layer i. This means A c i C Aj and A c i is Roger-good for covering and 
Poltyrev-good for channel coding. A construction of such a nested lattice pair can be found in 
[25, Section 7]. 

The signal Xf i is computed according to this nested lattice pair as: 

Xk,i = « + <i) mod A Cii fc = l,2,i = l,...,M (70) 

where df i is the dithering vector, uniformly distributed over V(A ci ), perfectly known by all 
receiving nodes and independently generated for each node, each layer and each block of A^ 
channel uses. Let uf i be the lattice point such that: 

uf ti e V (Ac,*) n A,, k = 1,2 (71) 

Note that both node Si and ^2 use the same lattice codebook for each layer. 
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Define Ri as the rate of the codebook for the ith layer: 

^i = ^log 2 |V(A C)i )nA i | (72) 

Since is uniformly distributed over V(A C) j), the average power per dimension of the ith 
layer Pj is given by: 

|2 



Pi — — r?rm — vr / Il x ll2^ x C73) 



1 

N vol(V(A c ,i)) Jxev(A c>i ) 
where vol(V (A c .j)) is the volume of the set V (A Cji ). 

As shown in (1231 ), Node receives gY"i, which, due to (1691 ), can be written as 



M 



qY 1 = E (g^fi + (p + 7) + gV&Zf (74) 
t=i 

When decoding layer z, we assume the decoder at node D\ starts from: 

E + (P + 7) + gv^Zf (75) 



Node £>! first decode qu^ + pu^ mod A C) j, then decode w^. 

Note that since p and g are both positive integers, qu^ + pu^i mod A c>i e A, fl V(A C)i ). 
In order to decode quf { + pu^ mod A ci , the decoder computes 



i-l 



% = [(<?< + P<d + I*"* + E (<Z*m + + 7) ^t) + gv^Zf] mod A Cji (76) 



from (1731 ) since it knows and dfV Note that in (1761 ), even though both the signal qu^ + 

i—l , \ 

puoi mod A ci and the noise term 7Xi v -+ X) (g^u + (p + 7) ) +QvbZ^ contains w^-, they 
are independent because is independent from due to the dithering vector d^. Hence 
Lemma [3] applies. Note that the same technique was used in [25, Lemma 2]. 
Define A* as 

i-l 

A = E (V + (P + 7) 2 ) Pt + q 2 b (77) 
t=i 

Then, according to Lemma [31 the probability that node Di does not correctly decode qu^ + 
pu^i mod A C j, decreases exponentially fast with the lattice dimension N if 



1, ( P 



After decoding guf i + p«2 i m °d ^-c,u node £) x can recover: 



i-l 



h x l + E (9^5 + (P + 7) * 2 ") + gv^Zf] mod A c , (79) 



f=i 
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from {75>. 

According to Lemma 0], as long as 

Pi > 7 2 P 4 + At 

(1791) equals 

f=l 

with high probability. If ( 1791 ) does not equal (|8TT) , a decoding error is said to occur. 
Node D\ then evaluates the following expression from (I8TT) : 



JV 



(80) 



(81) 



i-l 



k lK + E (QXZt + (P + 7) + gV&Zf 



t=i 



mod 7A C 



'i-l 



7<i + (*-!) + * £ + (p + 7) X») + gv^Zf 



a=i 



mod 7A C 



(82) 



(83) 



where the scalar k corresponds to a in [25, (13)]. Like [25], we choose k to minimize the 
variance per dimension of the term 



'i-l 



(k ~ 1) 7*£ + k( E + (P + 7) X") + qVbZ? 



(84) 



a=i 



(1841) is called the "effective noise" in [25] and its minimal variance per dimension with the 
optimal k is given by: 

T^Ta" (85) 
We then apply Lemma [3] which says node D\ can correctly decode from (l83l with high 

probability if 



• 1 , / 7 2 ^ 
i?i < - log 2 



( y 2 PiA % 



After decoding u^, node can recover the following signal from (1811) : 



i-l 



E(^S + (P + 7)X£) + gVfeZf 



(86) 



(87) 



which will be used when decoding lower layers. 

After describing the decoding procedure, we proceed to determine the power Pj and the rate 
Ri of each layer. As in [38], [43], we let the right hand side of (1781) equal the right hand side 
of ([861): 



Pi 

i 2 P l + A 



1 + 



i 2 P l 
A: 



(88) 
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It is easy to check that (1881) has the following solution^: 

Pi = aA (89) 

where a = 1 2-7 "^f 1 ^-. This leads to (T26l) in Theorem |4j For a to be real, we require 
1 — 4^ 2 > o, which means [7] < 0.5. 

By solving (1891) and (1771 ), we find that P is given by: 

Pi = a(a(3 + g 2 6 (90) 



where (3 = q 2 + (p + 7) 2 . This leads to (l27l) . 
For this power allocation, is given by 

A = (a(3 + l) 1 ' 1 q 2 b (91) 

Due to ([88]), P; can be found by averaging £78]) and <[86l) and substituting ([90]) and d9B into the 
result: 

=0.251og 2 (a) (94) 
It remains to check the requirement (|8Q|) . To do that, we substitute (l89l) into (f8~0l and get 

(l - 7 2 ) a > 1 (95) 



where a can be expressed in terms of 7 as shown in (1261) . It can be verified that the left hand 
side of (1931) is always greater than 1 if I7I < 0.5. Hence (f8~0l) is fulfilled. 

The total power consumed by node Si can be computed from (l90l) and is given by 

ffi- ^ + f -V » ,96, 

t=l M 

Having established the rate of the nested lattice codebook at each layer in (|94l) , we next compute 
the secrecy rate. For this purpose, we require that u^i, i = 1, M are independent and are also 
independent from u^i = 1, ...,M. Each u^i = 1, ...,M must be uniformly distributed over 



2 1_27 2 — — 47 2 

The other solution is when a = ^ ■ ^ turns out this solution does not achieve positive secrecy rate. 
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V (A c j) flAj. Then, if we view u^, k = 1, 2, i — 1, M as the inputs to the channel, the signal 
transmitted by node S 2 is independent between every block of jV-channel uses. Hence Theorem 
|2] applies. This means any secrecy rate R e such that 

< R e < [lim 4( J «> i = 1, M; Yf, d£, fc = 1, 2, i = 1, M)- 

/«, z = 1, .., M; yf, fc = 1, 2, z = 1, M))]+ (97) 



is achievable. Hence we only need to find a lower bound to the right hand side of (1971) . To do 
that, we first need to determine the distribution of u^^i = 1, ...,M. Here we choose it as the 
same distribution we chose for = 1, ...,M. This means u^^i = 1, ...,M are independent 
and each of them is uniformly distributed over V (A c ji ) H Aj. For this distribution, 

lim ^/«, z = 1, M; Ff, fc = 1, 2, z = 1, M) (98) 

1 M 

< lim -tf(<.,z = l,...,M)=E^ (99) 

jv— >oo iv — : 

On the other hand, we know that, for a given M, u^^i = 1,...,M can be decoded from 
{Y-f^jd^^k = 1,2, i = 1,...,M} using the decoding procedure described in this proof. By 
Lemma [3] and Lemma HI the probability of decoding error decreases exponentially fast with 
respect to N. If P e denotes the probability of decoding error, then, by Fano's inequality [46], 
we have 

«, i = l,..., M\Y», < 4 , k = 1, 2, i = 1, M) (100) 

1 1 M 

<- (l + P e H(u^,t = 1,..,M)) = - + PeE^ ( 101 ) 

Therefore 

lim z = 1, M; if, < i5 fc = 1, 2, z = 1, M) (102) 

= lim «, z = 1, M) — H «, i = l,..., M |Yf, < 4 , fc = 1, 2, i = 1, .., m)) (103) 

jV— >oo iv v 7 

AT 1 AI 

>E^"T7- P eE^ ( 104 ) 

t=l iV i=l 

By letting jV -> 00, d99j) and (11041) imply: 

-1 M 

lim ■-(/«, i = 1, M; yf, = 1, 2, z = 1, M) = E ^ (105) 

«=1 
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where E4 is given by (19"4l ). 

The second term in (l97l) can be upper bounded as follows: 

1/ «, i = 1...M ; >f, < i; fc = 1, 2, z = 1...M) (106) 

<lj «, i = 1...M; Xf ± Xf, d&, fc = 1, 2, i = 1...M) (107) 

1 M 



8=1 

<M (109) 

(11071) follows from the fact that telling the eavesdropper the channel noise will not harm its 
ability to obtain more information about the confidential message. (11081) is because the jamming 
signal X^i and the dithering noise of different layers are independent from each other. 
Finally, we apply (l51)-(fT0l) to each term inside the sum in (11081) to obtain (1 1 09b ■ 

Substituting ( 11051 ) and ( 11091) into ( 1971 ), we find that the following secrecy rate is achievable. 

M 

R e =[Y,Ri-M] + (110) 

i=l 

= [(0.251og 2 (a)-l)M] + (111) 
The secure degree of freedom is therefore given by 

M 

o [ERi-M]+ 
lim : - 6 = lim ^ rr (112) 

F-oo|log 2 P «-ooilog 2 E^l^ 



0.25 log 2 (a)-l 



(113) 



I log 2 + 1) 
Hence, we have proved Theorem HI 

Remark 15: can be replaced with deterministic vectors. The proof is given in Appendix 
□ 

Remark 16: We scaled the signal by k in (l83l) before performing the modulus operation. 
Doing so offers a slight gain in secure degree of freedom than just choosing k = 1. 

The scaling by k operation can also be done in (|76|) . However, the optimal scaling factor has 
a more complicated expression and it is difficult to derive an analytical expression for the secure 
degree of freedom if this approach is followed. □ 
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Appendix C 

Replacing Random Dithering Vectors with Deterministic Vectors 

In this appendix, we shall demonstrate that the dithering noise df k in Appendix |B] can be 
replaced with deterministic vectors. For a fix number of layers M, let ej denotes the binary 
random variable such that ej = 1 denote the event that a decoding error has occurred at layer 
i given no decoding error has occurred at upper layers j, M < j > i. There are three types of 
error events that results in = 1, depending on: 

1) whether node D\ can decode qu^ { + pu^i mod A Cji from % given by (1761) correctly. 

2) whether £79]) equals (f8Tl 

3) whether node D\ can decode u^i from (l83l) correctly. 

By Lemma [3] and Lemma HI the probability of each error event goes to exponentially fast as 
the dimension of the lattice N goes to oo. Hence from the union bound, we have: 

Pr(e,) < exp(-Nai) (114) 

where &i is the error exponent of the error event {e^ = 1}. Let a = muiiC^. Let e be the 
binary random variable such that e = 1 means a decoding error has occurred. Then {e = 1} = 
ufiQ 1 {e i = 1}. From the union bound, we have: 

Pr(e) < Mexp(-Aa) (115) 

Another way to view this result, as described in [35], is that (II 151) is the average performance 
of an ensemble of codebooks. Each codebook is denoted by C fc , k = 1,2 with parameter dj?^ i = 

l,...,M,k = 1,2, where 

Ck = {« + <*) mod A c>i : 6 V (A Cji ) n K, i = 1, .., M} (116) 

Each term inside the bracket on the right hand of (II 161) . which is [u k i + d^A mod A C j, denotes 
the codebook used for each layer. Although the transmitted signal is the sum of the codewords 
from the codebook of each layer, they can be differentiated by the receiver due to a careful 
power and rate allocation between layers as shown in Appendix El 

Note that because of the nested structure, (u^ + d^^j mod A C j contains the same number 
of lattice points regardless of the choice of d ki [35]. Therefore, C\. has same rate regardless 
of the choice of d ki . That said, this choice may lead to different average power and error 
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probability performance. ( II 151 ) can then be interpreted as the error probability averaged over the 
code ensembles and can be expressed as 

£ ClA [Pr(e)] < Mexp(-iVa) (117) 

where E Cl £ 2 l ] means the expectation considering the codebook Ci,C 2 as being random. From 
Markov inequality, at least (1 — e) fraction of the codebook C\ has the following property: 

^pr(e)|Ci = CI] < ^ Cl ,c 2 [Pr(e)] (118) 

Let P[ be the average power of a certain codebook Cj and P[ be the average power of the 
codebook ensemble. At least 2s fraction of the codebook C\ has the following property: 

Therefore, there must exists one codebook C* that meets (II 181) and (II 191) simultaneously. Let 
Ci = C{ to be this codebook, and apply the same argument above to C 2 . Then there must exists 
one codebook C 2 such that 

Pr(e|& = C*,C 2 = £*) < ^ C2 [Pr(e)|d = C{] (120) 

K < Y^2~ £ P " ° 21) 
Note that this is a consequence of the fact that C\ is independent from C 2 . Therefore conditioning 
on C* does not change the distribution of C 2 . Hence conditioning on C\, the average power of 
C 2 ensemble remains to be P 2 . 

Now choose e as a function of N such that 

lim lE CliC2 [Pr(e)] = (122) 

N^oo s 

lim e = (123) 

which is possible by letting e = 1/N, because for a given M, Ec 1: c 2 [P r ( e )] decreases exponen- 
tially fast with N as shown by (II 171) . From (II 181) and (11201) . we know for each choice e, there 
exists a pair of codeword C*,C 2 such that 

Pr (eld = CIC2 = C*) < \E Cl . C2 [Pr (e)] (124) 

and the power consumption meets the ( II 191 ) and (11211) . This, along with (11221) and (11231) . means 
that there must exists a pair of codebooks (C\ = C*, C 2 = C 2 ) that meet both the error probability 
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requirement and the power constraint requirement when N goes to oo. This means that the 
dithering vector {df k } in this work can be deterministic vectors. 

Appendix D 
Proof of Theorem [5] 

In this appendix we provide the proof for Theorem [5l We use the same integer lattice codebook 
as in [23, Theorem 1]. Let Ap £ be the scalar lattice defined as: 

Ap £ = {x : x = P 1/4+e z, z G zj (125) 

The codebook Cp £ is given by: 

C P<£ = Ap £ n 



P, y/P\ (126) 

where P = min{Pi,P 2 }. Hence — p 1 / 4 -^ < z < p l / A - £ 5 where [ ] and [ J denote the 
up and down rounding operations respectively. |Cp, e | > 2 ^P 1 / 4-2 — lj + 1 = 2P 1 / 4_e — 1. This 
means that, for large enough P, we can write: 

log 2 \C P , £ \ > log 2 (2P x / 4 - £ - l) > log 2 (P 1 / 4 -) (127) 

The same codebook is used by both nodes Si and S^- The codeword transmitted by node Si 
is chosen based on the secret message W\. The codeword transmitted by node S2 is chosen 
independently according to a uniform distribution over Cp £ . 

Since the input X 2 from S 2 is i.i.d., the channel can then be shown to be equivalent to a 
memoryless wiretap channel [3]. Similar to the proof of Theorem HI we use Theorem [2] with 
n — 1, which states any secrecy rate R e such that 

< R e < [I (Xi; Yi) -I(Xi; Y 2 )} + (128) 

is achievable. Hence to compute the achievable secrecy rate, we need to find a lower bound to 
the right hand side of (11281) . 

According to [23, Theorem 1], p(Xi) is chosen to be a uniform distribution over Cp )£ . Therefore 

P(X 1 )=log 2 (|Cp £ |) (129) 

From [23, Theorem 1], when 

p > ( i3 °) 

a z o z 
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we have 

H {X X \Y X ) < 1 + 2exp (~^pj !og 2 (fel) (131) 

The fact that ^/ab is algebraic irrational is used in [23, Theorem 1], which uses a result from 
Diophantine approximation. (11301) comes from [23, Lemma 2]. b in (11311) comes from the fact 
that the variance of the Gaussian noise contained in Y\ is b instead of unity. 
From (11291 ) and ( 11311 ), we have: 

/(*i;*i)> ^l-2exp^-^jjlog 2 (|C P , £ |)-l (132) 

Since Cp e is simply a scaled and shifted version of the integer lattice code, we can use Lemma 
□ to bound I(X 1 ;Y 2 ): 

I (Xi; Y 2 ) < I y 2 , Z 2 ) = / (Xi; X ± X 2 ) = f(Q) < 0.8 (133) 

where f(Q) is defined in (1281) . Q is determined by the range of z in (11251) . Hence Q = 
2[P 1/4 ~ e J + 1. The last inequality in (fT33l) follows from Lemma [IJ Using (fT32l) (fl33l . and 
(11271) . we find (11281) is lower bounded by 

[ f 1 - 2 exp ( VI - e) log 2 (P) - 1 - /(g)] + (134) 



8b J J V4 

for sufficiently large P. Since < f(Q) < 0.8, and e can be any value between (0, 1/4), using 
the definition in (f2~Tj) . we find the achieved secure degree of freedom is 0.5. Hence we have 
completed the proof. 

Appendix E 
Proof of Theorem [6] 

In this appendix we provide the proof for Theorem [6] We begin by considering the case when 
\foh > 2. Let 

Af-l 

x k = VP) E a k , t Q 2 \k = 1,2 (135) 

where M is a constant positive integer, P is a constant scaling factor. Both are related to the 
variance of X k . is uniformly distributed over [0, [QJ — 1] HZ. Due to the range limit imposed 
on ajfei, we observe that a^j is uniquely determined by X k . 
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The signal received by node D x is given by 

M-l M-l 

Yi = y/P { E a ^ + E «2,Q 2l+1 ) + VbZ, (136) 

We then derive a lower bound to [I(Xi, Y\) — I(X 2 ; Y 2 )] + as we did for Theorem [5l 
I(Xi]Yi) can be lower bounded by following a similar derivation from [23, Theorem 1]. 

M-l 

Define C fc = J2 CLkiQ 2t '■ a-ki £ [0, [<5J — 1] H Z}, fc = 1, 2. We use the same maximum 

i=0 

likelihood decoder used in [23, Theorem 1]: 

Yi = axg min \Y ± - {X x + QX 2 )\ 2 (137) 

Xi+QX 2 , s.t. x fe ec fe ,fe=i,2 

It is clear that given Y" l5 there is a unique pair of X X ,X 2 such that X\ + QX 2 = Y"i. Let this 
mapping from Y\ to X\ be /. Define binary random variable A such that 

f if iVfcZil < v^/2 
A= ^ (138) 
1 otherwise 

Note that if A = 0, we have X\ = /(Yi). For this definition of A, we have 

H(X 1 \Y 1 ) <#(Xi,A|Yi) (139) 

=#(A|Yi) + #(x 1 |Y 1 ,A) (140) 

<1 + Pr (A = 1) # (Xi|fi, A = l) + Pr (A = 0) H (Xi|Yi, A = 0) (141) 

= l + Pr(A = l)i/(Xi|Yi,A= l) (142) 

<1 + Pr(A = l)H{X l \A = 1) (143) 

=1 + Pr(A — l)H (Xi) (144) 

(11441) is because A is independent from X x . Pr(A = 1) is bounded as follows: 

Pr (A = 1) = / --L= exp ( -VI dt<2 exp — ) (145) 



Substituting it into (I144|) . we get: 



H(X 1 \Y 1 ) <l + 2exp(-^)H(X 1 ) (146) 



Therefore I(X\\ Yi) is lower bounded as: 

> I 1 - 2 exr> I - 

86 



I{X 1 ;Y 1 ) >I{X 1 ;Y 1 ) > ( 1 - 2 exp ( -§ ) ) H (X 1 ) - 1 (147) 
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For I(Xi\ Y 2 ), we have: 



M-l 



I(X i; Y 2 ) < J(X i; X 1 ±X 2 )<J2 Haiti ± = Mf([Q\ ) (148) 

i=0 

Given Xi as defined in (11351) . we have H(X X ) = M\og 2 [Q\. Substituting it into (11471) and 
combining it with (11481) , we find, from (11281) , the following secrecy rate is achievable. 

R e = [M(l - 2exp(-^))(log 2 LQJ) - 1 - Mf([Q\)] + (149) 



The transmission power is given by 







Var [X k ] = P„ A 9 { 2 ' ) £ Q 4i (150) 

The secure degree of freedom can then be computed by substituting the transmission power 
(11511) and the secrecy rate (11491 ) into (1211) . which yields: 

lim [((i-2exp(-f))io g2 Lgj- / (Lgj ) )M] + 

(11531) can be made arbitrarily close to (l32l) by choosing a large enough P . (1331) then follows 
from (|32|) via Lemma CD 

When Q = 2, it can be verified that /(|_QJ) = 0.5, and (11531) can be made to be arbitrarily 
close to 1/4. 



The case of l/yab > 2 can be proved in a similar fashion. Let l/yab = Q and let X k = 

M-l 

QVPo 2 CLkiQ 2 \ k = 1,2. Then all previous derivations apply. In particular, (11361) becomes: 

8=0 

M-l M-l 

Yi = y/Poi J2 <*i,iQ 2i+1 + E a ^Q 21 ) + ^i (154) 

i=0 i=0 

and (11371) becomes: 

Yi=arg min \Y X - (X, + ^-X 2 )| 2 (155) 

Xi+x 2 /Q, s.t. x fc ec fc ,fe=i,2 <5 



3 In case it is desired for Xk to have zero mean, we can simply shift Xk by a constant, which will not alter the secrecy rate. 
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The achieved secrecy rate remains the same. The transmission power is scaled by Q 2 : 

V^[X k } = Q 2 P (^f^j^^, fc = l,2 (156) 
Hence the secure degree of freedom is still given by (1331) when Q > 2 and 0.25 when Q = 2. 

Appendix F 
Proof of Theorem [7] 

Let X k be given by (11351) in Section |E] with Q = 2. However, unlike what we did in (11351) . 
Ofe j is not uniformly distributed over {0, 1}. Instead, we choose its distribution to maximize 

I{a\,i] ai,i + a 2 ,i) - I{a\,i] a\,i - a 2 ,i) (157) 

which is about 0.1095 when Pr(a M = 1) = 0.1443, Pr(a 2ji = 1) = 0.8557. We next derive 
the achievable secrecy rate by deriving a lower bound on [I(Xi,Yi) — 7(X 2 ;Y" 2 )] + . Define 

_M-1 

C k = {VPo £ a k)l Q 2i : a Kl G {0, 1}}, k = 1,2. Define /fa) as 



i=0 



Then: 



^ = K « J r 1+ ^ W l yi -^ + ^l (158) 



7(X 1 ;X 1 + X 2 )-7(X 1 ;/fa)) 

</(X i; X 1+ X 2 |/fa)) (159) 

<77(X 1 + X 2 |/fa)) (160) 

<l + 2exp(-||W(X 1 +X 2 ) (161) 



86 

Inequality (11611 ) follows from (|139D - (I144I) with % replaced by f(Y 1 ) defined in (11581 ), and Xi 
replaced with X 1 + X 2 . Then we have 

HX 1 ;Y 1 ) 

>J(Xi;/fa)) (162) 

D) - 

S6, 



> ( 1 - (2exp ( ) ) ) 77 + X 2 ) - H{X 2 ) - 1 (163) 



/ (X i; X 1 + X 2 ) - 1 - 2 exp 77 (X x + X 2 ) (164) 

A/-1 , p v M-l 

^ 7 (oy; ai,i + 02,t) - 1 - 2 exp ( — -J ) ^ 77 (a M + a 2)i ) (165) 
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In (11631) we use the fact that X\ is independent from X 2 and apply the result from (11611) . ( 11651 ) 
is because there is a one-to-one mapping between X 1 + X 2 and {a\ yi + a 2:i , i = 0, M — 1}. 
For I(Xi, Y 2 ), we have 

I(X 1 ;Y 2 )<I(X 1 ;X 1 -X 2 ) (166) 

M 

<J2 I ( a ^ a ^i~ a 2,i) ( 167 ) 
i=l 

Therefore 

I (X^Y!) - I {XnY 2 ) 

AI / P \ M ~ l 

>J2(I (a 14 - a hi + a 2 ,i) - / (a M ; ai,< - a 2;i )) - 1 - 2 exp ( ) V # (a^ + a 2|i ) (168) 
i=i v m/ t=0 

=M0.1095 - 1 - 2 exp M# (on + a 2 ,i) (169) 



86 

Let K fc be the variance of a k i. Then we have 



4M - 1 

Var [X k ] = PoVa, k x , k = 1, 2 (170) 

Hence the secure degree of freedom is given by: 

lim (171) 

m^oo Il 0g2 ^ar[X fc ] 

= [0.0548 - exp (a M + a 2ii )] + (172) 

We can always choose a large enough P to make the secure degree of freedom to be arbitrarily 
close to 0.0548. This completes the proof of Theorem |7J 



Appendix G 
Proof of Theorem [8] 

Let Xi be: 

3M-1 

x k = \/Po E a k ,iQ i ,k = 1,2 (173) 

i=0 

where Q = 2. M is a positive integer and P is a positive constant as defined in the proof of 
Theorem [6l Choose a^i such that 

a k i = 0, if % mod 3 = 1 or 2 (174) 
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otherwise is uniformly distributed over {0, 1}. Here a^j is forced to be zero at i mod 3 = 
1 or 2 to make room for 0.5a 2j -, Vj mod 3 = and the carry-over from a x j + a 2 j, Vj mod 3 = 0. 
Define as the collection of points X k defined by (11731) . Define Y\ as 

Fi = arg min \Y X - (X 1 + 1.5X 2 )| 2 (175) 

X 1 +1.5X 2 , s.t. X fc eC fc ,fc=l,2 

With this new definition of i^, the same derivation that leads to (11471) applies, where H(Xi) = 
M. On the other hand: 

3A/-1 

/ (Xr, Y 2 ) <I(X 1 ;X 1 ±X 2 )< £ / (ai,il ai,i ± a 2 ,i) = 0.5M (176) 
From (11471 ), (11761 ) and (11281) , the following secrecy rate is achievable: 

8b, 



1 - 2 exp I -§ ) ) M - 1 - 0.5M 



(177) 



The transmission power is given by 



^w-^S« m -tW (178) 



4 

for Q = 2. The secure degree of freedom is hence given by: 



;i-2ex P (-f))M-l-0.5M 

llm h ( 179 ) 

A/— »co I log 2 (Q6A/ ) 

/ 1 - 2 exp (--£) - O.5) (180) 
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which can be made arbitrarily close to 1/6 by choosing a large enough P - 

Appendix H 
Proof of Theorem [9] 

We prove Theorem [9] by showing that the loss in secrecy rate due to the imperfectness of 
channel state information can be bounded by a constant. 

The challenge in proving this result arises due to the fact that the assumption on a { essentially 
makes the channel in Figure [10] an arbitrary wiretap channel for which an explicit calculation 
of the secrecy rate is difficult [47]. To work around this problem, we consider the channel in 
Figure [[2] and present the following Lemma. 

Lemma 5: Any secrecy rate achievable in Figure [l2l is also achievable in Figure [10] with the 
same coding scheme. 



46 




Proof: We need to show the confidential message W\ is reliably received by D\ and the 
secrecy constraint (fl4l) is met. The first requirement is met automatically since that the signal 
received by D\ remains the same. Hence we can start with a coding scheme that reliably transmits 
W\ from Si to Di for Figure [12] and prove that the secrecy constraint (TBI) is fulfilled with this 
coding scheme for Figure \\0\ For the coding scheme, the layered nested lattice codes from 
Theorem [4] is used. 

Let n be the total number of channel uses. Let D be the nxn diagonal matrix, whose diagonal 
element at the zth row and ith column is c^. Define a diagonal matrix D such that D is obtained 
from D by replacing the elements on its diagonal line with « max - Define Z£ as independent 
random variable with the same Gaussian distribution as Z 2 . Z™,i = 2,3 are independent. Then 
the mutual information between W\ and the knowledge of the eavesdropper in Figure [10] can be 
upper bounded as follows: 



</ (Wi; Y 2 n , <% ti , k = 1, 2, i = 1...M, D) (181) 

<I [Wi] Xf ± X™ + Zl + DX™, dj*,, k = 1, 2, i = 1...M, d) (182) 

=/ (W i; X? ± X 2 n + Z 2 n + DXf, D, d^, k = 1, 2, i = 1...M) (183) 

<J (Wi; Xf ± X 2 ", Z 2 " + DAT, A fc = l,2,i = 1...M) (184) 

<I (Wi- X™ ± X 2 n , Zl + DX™, D, d^, fc = 1, 2, z = 1...M) (185) 



\ 



2 ZJ + DXf, fi, <e„ i = 1, 2, i = 1...M I 



a 2 

max 
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<I ( Wi; X? ± X 2 ", — + M?, D, d n kp k = 1, 2, i = 1.. V) (187) 

=/ (Wi; Xf ± X 2 n , Z 2 " + a max X?, D, d^, k = 1, 2, i = 1...M) (188) 

Note that (11881 ) is exactly the same as the mutual information between W\ and the eavesdropper's 
knowledge in Figure [[2l Hence if 

lim —I (Wi, X? ± X 2 n , Zl + cwXf, ^, dfci, = 1, 2, z = 1...M) = (189) 

n— >oo 77, V ' y 

Then we must have: 

Jim -/ (Wi; Y 2 N , d£, k = 1, 2, i = 1...M, £>) = (190) 

This means that to obtain the secrecy rate for Figure [Kjl we can as well compute the secrecy 
rate for Figure [T21 ■ 
Note that with the layered nested lattice coding scheme in Theorem when each lattice point 
is viewed as a single channel use, and the channel in Figure [12] is equivalent to a memoryless 
wiretap channel. Then according to Theorem [21 the following secrecy rate R e is achievable: 

< R e < [ lim ^(/«, i = 1, M; , d£, k = 1, 2, i = 1, M)- 

z = 1, M; Xf ± Xf , a max Xf + Zf, d£, fc = 1, 2, z = 1, M))] + 

(191) 

The first term in (11911) is still given by (11051) since the signal received by D\ remains the same. 
The second term in (11911) can be upper bounded as follows: 

1/ «, z = 1...M; Xf ± Xf, Z$ + « max Xf , d£, fc = 1, 2, i = I...M) (192) 

=^J «, i = 1...M; Xf ± Xf, Z» + « max Xf |d£, fc = 1, 2, i = I...M) (193) 

<1/ z = 1...M; Xf ± X^dJ, fc = 1, 2, z = 1...M) (194) 

+ 1/ z = 1...M, Xf ± X»- Z$ + a max Xf |d£, fc = 1, 2, z = 1...M) (195) 
=-Lf «, z = 1...M; Xf ± XfldJ, fc = 1, 2, z = 1...M) 



+ i-J (<, z = 1...M; + « max Xf |d* , fc = 1, 2, z = 1...M) (196) 
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(fl96l) is because {Xf ±X? } - {uf^i = l...M,d% ii k = l,2,i= 1...M} - {Z* + a max Xf} is 
a Markov chain. The first term in (11961) is shown by (I107l) - (|109l) to be bounded by M. Hence 
we only need to bound the second term. This term is bounded by: 

1/ «, k = 1, 2, i = 1, M- Z» + « max Xf ) (197) 

<lj (Xf ; + a max Xf) (198) 

<C(«Lx^) (199) 

where C(x) = | log 2 (l+x) is the channel capacity formula of average power constrained AWGN 
channel. P ls the total transmission power consumed by Si, is given by (l96l ). The inequality ( 11991 ) 
is due to the fact that for additive channel with i.i.d. Gaussian noise the mutual information is 
maximized by Gaussian input distribution for given transmitter power constraint. 

Applying (11991 ), (I107l) - (|109l) , and (11051) in (11911 ), we find the secrecy rate is given by 

R e = [(0.251og 2 (a) -1)M-C («Lx^)] + (200) 

Compared it to (|1 1 1|) . the loss in secrecy rate is bounded by C (a^ ax Pi), which is bounded by 
a constant per the condition stated in Theorem HI Hence the achieved secure degree of freedom 
remains the same as Theorem HI 
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